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1. Introduction 



We are interested in giving meaning, within Kolmogorov's axiomatics, to nonclassical 
versions of the total probability formula. This problem was considered in Khrennikov, 
2001a, 2001b, on physical level of rigor, using the frequency approach, see von Mises, 
1964. 

Almost any treatise on quatum mechanics dwells on the fundamental two-slot exper- 
iment with e.g. electrons passing through one or two open slits (see, e.g., d'Espagnat, 
1999, p. 6). Let Aj denote an event of passing through the slit with label i, here i — 1,2. 
Interpretation of the results of this experiment and the related ones (see, e.g., Feynman 
and Hibbs, 1965, p. 11) has led to the following formula for the probability of electron 
passing through the two open slots: 



P(A 1 U A 2 ) = P(Ai) + P{A 2 ) + 2^/P{A l )P{A 2 ) cos 9 (1) 

where P is a symbol of probability and 9 a certain parameter. Usually the origin of the 
term 2a/ P~{Ax)P{A^) cos 9 is explained by reffering to "self- interference" inherent to the 
"wave nature" of an electron. To be more precise, calculating relative frequencies of the 
considered events after repetitions of the experiment led to conclusion that the classical 
formula of addition for probabilities of disjoint events should be modified. 
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A remarkble achievement of physics was the employment of Hilbert space methods 
for description of quantum mechanical systems. In the standard formalism (see, e.g., 
d'Espagnat, 1995, p. 63-65) one derives formula ([!]) with the help of superposition princi- 
ple. This elegant proof using complex- valued coefficients of a "state- vector" decomposition 
with respect to the basis of eigenvectors of a self-ajoint operator corresponding to certain 
" observable" rather loses links with traditional concepts of probability theory. Moreover, 
a number of specialists in quantum mechanics believe that with the Kolmogorov axiomat- 
ics it is impossible to justify formula (0). The discussion continues from the early days 
of the subject and still the joint view-point is not generated, see, e.g., Heisenberg, 1930, 
Dirac, 1930, Bohr, 1934, von Neumann, 1955, Feynman and Hibbs, 1965, Jauch, 1968, 
Beltrametti and Cassinelli, 1981, Holevo, 1980, 2001, Accardi, 1984, Ballentine, 1986, 
Parthasaraty, 1992, Meyer, 1993, Peres, 1994, d'Espagnat, 1995, 1999, Busch et al., 1995. 

The careful analysis of probability spaces and families of random variables used for 
models of stochastic experiments gives the key to understanding the origin of the formulas 
like (|) (ch. Khrennikov, 1999, p.100-102, 2000, p. 5936,Thorisson, 2000, p.29, Accardi 
and Regoli, 2001, p. 8-9). The main role here is played by the following contextualism 
principle^, introduction of a random variable or a collection of random variables describing 
some properties of a physical system or an ensemble of systems should take into account 
the total collection (class) of the conditions, i.e. context, under which the values of these 
variables (or their distributions) were specified. 

In fact the contextualism principle can also be found in the fundamental work by 
Kolmogorov, 1933, or in his paper, 1965 (p. 249): "Thus, to say that an event A is 
"random" or "stochastic" and to assign it a definite probability p = P(A\S) is possible 
only when we have already determined the class of allowable ways of setting up the series 
of experiments. The nature of this class will be assumed to be included in the conditions 
iS" . So, the role of a specified class of conditions which can be reproduced is stressed (see 
details in Gnedenko, 1962, Shiryaev, 1998). 

The aim of this paper is to show that a simple stochastic model of measurement 
of characteristics of some physical systems provides a quite natural explanation for the 
"interferentional" term arising in the formula (|I]). Let us remark once more that besides 
the probability space (fi, J 7 , P) the essential role in construction of probabilistic models 
is played by families of random variables used for description of studied phenomena. Of 
course, having specified the model, one can consider the new (canonical) probability space 
with coordinate random variables. 

It should be emphasized that neither we are going to analyze concrete quantum me- 
chanical systems, nor impose on them a classical probability space (Q, P). Our goal is 
not to transform the physical phenomena into "classical" probability events, i.e. elements 
of a <T-algebra JF of subsets of a space Q, where P is a measure defined on events and 
normalized so that -P(fi) = 1. Deep study of the quantum probability problems can be 
found, e.g., in Accardi, 1984,2001, Holevo, 1980, 2001, Meyer, 1993, Parthasaraty, 1992. 

In the sequel it will be convenient to give another form to relation ([]]). Set C = AiUA 2 

1 In quantum theory this principle was introduced by N. Bohr who underlined that experimental 
conditions plays the crucial role in determining physical observables, see e.g. Bohr, 1934. 
2 The choice of the principal features (conditions) leads to various models. 
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where A\ D A 2 = and rewrite ([I]) as a " nonclassical total probability formula": 

P(C) = P(C\A 1 )P{A 1 ) + P(C\A 2 )P(A 2 ) + 2^P(C\A 1 )P(A 1 )P(C\A 2 )P(A 2 )cos9, (2) 
where, as usual, 

P(C\A t ) = P(CA)/P(Ai) 
and P(Ai) > 0, z = 1, 2. The trivial case -P(^4«) = for some i is henceforth excluded. 



2. The model description 

For the sake of simplicity consider some physical system S posessing only properties f\ 
A and C described by a pair of two- valued random variables (A and C "characteristics"). 
Suppose that there is a procedure permitting to reproduce the copies Si, S 2 , . . . of S to 
get a statistical ensemble of S in the following sense. Let (a^\ c^), (a^ 2 \ c*- 2 -*), ... be a 
sequence of i.i.d. random vectors defined on some probability space (Q, J 7 , P). Assume 
that for some a«, q G R, i = 1, 2 

P( a ( 1 ) = a 1 )=K, P(a« = a 2 ) = p^, (3) 
P( c (D = Cl )=^, p( c (D =C2 ) =p c ; (4) 

where + = 1 and pi + p 2 = 1. 

For j E N and elementary event u; G f2 the vector (a^(uj), c^'(u)) characterizes the 
properties A and C of the system Sj. The independence assumption for the sequence 
{(a^\ c^)}j>\ reflects the usual noninteraction hypothesis for elements of stochastic en- 
semble. Further we can use more general assumptions as well. For each j G N no condi- 
tions are imposed on the joint distribution of random variables a^' and Moreover, 
note that for any to E fl the sequence {(a^\uj), c^\u)}j>i is not, in general, available. In 
fact, to get the data certain measurement procedures should be used. 

Suppose that it is possible to apply to systems Sj a measurement procedure M. A 
permitting for each u E f2 to fix the value of the characteristic describing property 

A. In other words, for every u E VL we have integers 1 < ki(u) < k 2 (uj) < . . . such that 

a (fclH) (w) = 01, a (fc2(w)) H = ai,... 

and, respectively, 

a (m ' W) (w) = a 2 for m x {u) < m 2 {u) < . . . 

where rrij{u)) E N \ Uj{/Cj(a;)}. 

It is worth to note that after the procedure Ai\ the property C of initial systems Sj 
in general can be changed. Thus, new systems Sj (j E N) arise. To describe the property 
C of these systems it is natural to use random sequences {c^}j>i and {c^^}j>i such that 

P(c^ = Cr )=pl P@» = cr)=p°, J,r = l,2, (5) 

3 On the notion of a property see, e.g. Khrennikov, 1999, p. 59. 
4 This is very natural from quantum viewpoint. 
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where pi + p\ = 1, pi + p\ = 1. 

Assume that for each cu £ and i = £ N such that a^(w) = a\ the value c^(c<j) 
appears instead of c^\u). Other values c^ 1 \uj) (for % = i(uo) £ N\ Uj{/ci(cj)}) are replaced 
byc«(u;). 

So, to describe the properties A and C for the ensemble of systems Sj (j £ N) we 
obtain two-dimensional sequences 

{K^^H)}^! and {(a2,$ mj{u)) (w))}j>i 

Note that since fcj = kj(u) and = rrij(uj) are random variables defined on probability 
space {Vt,T,P) the same is true of c (k ^ = c {k > {w)) (uj) and = c^^V). 

It is always possible to construct a probability space (Q, J 7 , P) where stochastic se- 
quences {(a^\ c^}j>i, {c^}j>i, {c^}j>i with the above-mentioned properties are de- 
fined. Moreover, using the standard product of probability spaces it is easy to guarantee 
the independence of terms of these sequences and independence of the collection of three 
sequences as well. However, we do not use the last opportunity. 

Suppose also that a measurement procedure A4c can be applied to systems Sj (j £ N) 
to fix the values of the characteristics of property C. Thus we get systems S'- (i £ N) 
having c^ k '^(u) or c^ m ^ w ^(c<j) as C-characteristics for u £ Q and i = kj(u) or i = m,j(u). 
Of course, the A-characteristics of systems S'j (j £ N) in general can not be conserved. 



3. Nonclassical total probability formula 

Now we consider from the view-point of limit theorems for random variables defined on 
(Q, J 7 , P) the approach used in Khrennikov, 2001a, 2001b. Namely, we discuss statistical 
properties of C-characteristics for ensembles of systems {Sj}j>x and {Sj}j>i. In this 
regard it is useful to recall the following statement from d'Espagnat, 1999, (see p. 15): 
"Quantum mechanics is essentially a statistical theory. Except in special cases it makes 
no prediction that bears on individual systems. Rather, it predicts statistical frequencies. 
In other words, it predicts as a rule, the number n of times that a given event will be 
observed when a large number of physical systems of the same type and satisfying 
specified conditions ar esubjected to a given measurement process" . 

We are not going to analyze the measurement procedures, prepearing (for measure- 
ment) procedures and the mechanism govering the modification of initial properties of 
systems under various influences (see, e.g., Holevo, 1980, 2001). 

Our very simple model with changing random variables permits to explain easily the 
asymptotical behaviour for frequences of events studied in Khrennikov, 2001a, 2001b. 

Introduce some notation. Let | • | be a number of elements of a finite set. For u £ fl, 
N £ N and j = 1, 2 let 

q jN (u) = \{ie{l,...,N}: c®(u) = c j }\/N. (6) 

Set Ni(u) = max{i : ki(u) < N}, N 2 (uj) = max{i : m^uj) < N}. For uo £ Vl and 
j = 1,2 put 

njl (u) = \{i £ {1, . . . , N^u)} : = Cj }\, 
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n j2 {u) = \{i G {1, . . . , N 2 (u)} : c^\u) = Cj }\, 

mjl (u) = \{i G {1, • • • , Aq(u;)} : = Cj }\, 

m j2 (co) = \{i G {1, . . . , iV 2 (cu)} : (a;) = c 3 }\. 

To simplify the notation (for j,r = 1,2 and ui G fi) we omit the dependence of rij r (uj) 
and rrij r (uj) on AT. 

Evidently, for all u G f2, j = 1, 2 and iVeN one has 

^(w) = + + 7jnH (7) 

where 

Hn{u) + . 

Theorem 1. Let {(a^, c (i) )}j>i, {(aV\&))}j>i and {(a^, c^)}j>i be sequences of 
pair wise independent random vectors identically distributed within each sequence, i.e. 
for all j G N 

(aW cW) = (a« c (1) ), (a«,c«) = (a^ 1 )), (a^) = (a^ 1 )), 

here X> means the equality of distributions. Assume that (Q) - (|5|) hold true and a« is 
not degenerate. Then for all j, r = 1, 2 and P-a.e u; G f2 there exist nonrandom limits 



c \a ,. m ir (o;) , 

J N—*oo ly r {UJ) N~*oo 

and besides 



Pj =VjiVi + Pj 2 P 2 +7r (9) 

Proof. Borelean functions of pairwise indepndent random vectors are pairwise indepen- 
dent. Therefore, by the Etemadi SLLN (see e.g. Borkar, 1995, p. 69) one has for P-a.e. 
u) G Q and j = 1,2 

1 n 

Qjn(uj) = J {cW(u,)= Ci }M -» ^^{ c (i)= Cj } = Pi as JV -> oo, (10) 

i=l 

where the indicator function of an event D 



In(oo) 



1, uj G D; 
0, cj^P. 



Analogously with probability one 



EM^ pl and as at^oo. (11) 

N N 
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It is clear that 



N N 

i=i 



1 N 

T7 ^2 / {c«(o;)= Cj }(^) / {aW(a;)=ai}( W )> 



m j2 {uj) 



1 N 

V J {^ (1) («)=<*} ( W ) J {a« (w)=oa} ( w ) ' 



iV JV 

The same reasoning shows that for P-a.e. uefi there exist 

Um ^ = ?( -(i )= (i ) = fli)) (12) 

lim = p(g(i) = a a W = ai ). (13) 

The event consisting of those a; rendering a^(w) = ai for all i G N or aW(w) = 02 for all 
ieN has P- measure zero. So, there exists an event U with P(U) = 1 such that both 
Ni(u) > and N 2 (u) > for all u G fi and iV > M(w). Hence for cu E U and iV > Af(w) 
we have 

rrij r (uj) mj r (uj) N r (u) 

~^\T~ = N r {u) N ' J ' r = 1 ' 2 - 

Taking into account (|TT|) - (|13|) we establish the existence of limits p-* in fl8|) and due to 
(|T0|) the validity of the second relation in (|8|) follows. Now (|9]) is obvious and the proof is 
complete. 

Remark 1. Successive employment of procedures Ma and A4c f° r systems Sj and 
Sj (j G N) permits to determine Ni(lu), N 2 (uj) and rrij r (uj) for j,r — 1,2. Thus, according 

to (^) and (JTTj) there are strongly consistent estimates for p c -^ and p" (j, r = 1,2). To 
determine qj^{uj) one should apply the procedure M. c to systems Si, . . . , Sjy. However, 
it would change the (distribution of) A-characteristics of these systems. Therefore, to 
obtain the strongly consistent estimates for parameters Pj (j = 1,2) we can proceed in 
the following manner. Invoking the principle of reproducibility we can construct besides 
the copies Sj (j G N) of a system S the copies Sj (j G N) described by a sequence of 
random vectors {(a^\ ^')}j>i- The standard extension of a probability space (fi,jF, P) 
can be used to define this auxiliary random sequence. For an extension we keep the same 
notation (ft,J-, P). Applying the procedure Ale to systems Sj (j G N) and defining 
qj^iuj) analogously to (|J) we see that for P-a.e. u G Q and j = 1, 2 



q jN (u)/N^p c j as N 



00. 



Thus under the conditions of Theorem 1 one can provide the strongly consistent estimates 
for the values jj (j = 1, 2) appearing in formula (|9]). 

Remark 2. The notation p^ for limits in @ seems natural as mj r (uj) / N r (u) is the 
relative frequency of appearing value Cj for the characteristics of property C in a sequence 
of those systems which have a r value for the characteristics of property A. However, the 
sense of formula (^) is demonstrated much better by relations QT2] ) and ( |T3"D showing that 
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pf q = p(c(D = Cj \a,W = ai ), p% = P(c« = cj\a^ = a 2 ). (14) 

Thus the additional (with respect to the classical total probability formula) term jj ap- 
pears in @ because the following equalities need not hold true 

p(c« = Cj .| a (i) = ax ) = p( c (i) = Cj | a (i) = 0l ) j 

p(c« = Cj |a (1) = a 2 ) = P(c (1) = c,|a (1) = a 2 ), 
In such a way it is more informative to rewrite (|9]) in the form 

r c\a n . c\a a . / 1r \ 

Pj=PjiPi+Pj 2 P% + lj, (15) 

setting now 

pf = P(c« = c>« = ai ), p% a = P(c« = Cj \a^ = a 2 ), j = 1,2. (16) 

Formula ( [L5|) clarifies as well that simultaneous measurement of characteristics A and C 
in general is not supposed. 

Remark 3. Due to (|1^) and ( jlB]) one has 

| 7 ,| <1, J = 1,2. 

In the particular case when for each j £ N random variables c^' and are independent 
and the same is true for c^') and a® Theorem 1 implies that 

pc = p y i+p y 2 + ly (17) 

So, the equality jj = (if 71 = then 72 = and vice versa) is satisfied if and only if the 
point p C j is appropriately located on the segment with end points p C j and p% Evidently, 
for any given p", p c j it is always possible to indicate p C j and p C j (j = 1, 2) to guarantee the 
relation jj = 0. Namely, one takes pj in such a way that 

max{0, (p* - pD/pI) < p) < min{l,p|/p?} 

and after that chooses p C j = [jfj — PiP^/p^, j = 1, 2. 

Remark 4. It is possible to write formula (pl|) analogously to (|2|) using the auxiliary 
parameters Xj determined by relation 



U = 2\/ptpiP% a P a 2^ i = M. 

Note that in general Xj can take any real values. In the particular case described by 
formula (|T7]) we establish that 

Xj < 1 t 3 i + t j2 > x fpj, 
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where t n = Jpffi, t j2 



PjP2 U = 1? 2). If, moreover, p^ = pj then 



Ail < 1 



1 + 2v^M 




1-2VPM' 



(18) 



The last relation shows that the possibility to represent Xj as the cosine of some angle 
in this special case means that the distribution of is obtained from that of by a 
"bounded perturbation". 

Remark 5. It is not difficult to complicate the scheme of measuring characteristics of 
properties A and C. For instance one can assume that procedures A4a and M.c do not 
permit to fix exactly in all experiments (i.e. for all systems) the values of characteristics 
for properties A and C. Then for validity of Theorem 1 it is sufficient to suppose that the 
number of faults (or " nonsuccessful" ) measurements is P-a.e. o(N) (as iV — > oo) among 
the systems with labels 1, . . . ,N. One can obtain the analogue of Theorem 1 assuming 
(see e.g. Khrennikov, 1999, p. 66) that prior to measurement procedures M.A. and M. c one 
has to implement some prepearing procedures Va and Vc (modifying in an appropriate 
manner the properties of systems under consideration). 
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